How Many Is Enough?—Statistical Principles for Lexicostatistics

نویسندگان

  • Menghan Zhang
  • Tao Gong
چکیده

Lexicostatistics has been applied in linguistics to inform phylogenetic relations among languages. There are two important yet not well-studied parameters in this approach: the conventional size of vocabulary list to collect potentially true cognates and the minimum matching instances required to confirm a recurrent sound correspondence. Here, we derive two statistical principles from stochastic theorems to quantify these parameters. These principles validate the practice of using the Swadesh 100- and 200-word lists to indicate degree of relatedness between languages, and enable a frequency-based, dynamic threshold to detect recurrent sound correspondences. Using statistical tests, we further evaluate the generality of the Swadesh 100-word list compared to the Swadesh 200-word list and other 100-word lists sampled randomly from the Swadesh 200-word list. All these provide mathematical support for applying lexicostatistics in historical and comparative linguistics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

History of Polynesian Languages

We have been using the term “Polynesian languages” implying that these languages belong in a single group and that they are somehow related. But what does it mean to say that languages are related? What is the basis for the assumption that, say, Hawaiian and Samoan are related? There are two major methods that are relevant to these questions: the comparative method and lexicostatistics. The com...

متن کامل

Department of medical education; A personal history

This is a brief overview of the history of formal introduction of the art andscience of education into the sphere of medical education in Shiraz. Before this introduction medical education was, and in the majority of other institutions world-wide still is, a simple transfer of knowledge from teacher to student. The students accepted their passive role because this was how they had been taught a...

متن کامل

Review of Statistics in Historical Linguistics Stephen Grimes

Introduction In this 1986 book, Sheila Embleton surveys the development and use of statistical techniques in historical linguistics. She argues that any successful lexicostatistic model must incorporate word borrowing rates. A computational model for family tree reconstruction using borrowing is introduced, building on ideas of Sankoff (Sankoff 1972). Embleton outlines future uses of lexicostat...

متن کامل

Legal and Ethical Principles of Criminalization in Iran’s Criminal Law

Background: The criminal code is the rules that restrict the rights and freedoms of a person to ensure peaceful coexistence. What behavior should be prohibited and which one can be removed from the circle of legal acts. How can the word ethics in the world of law refer to ethical and literary means from the past, and is called the tradition of morality, in the sense of moral standards? On the b...

متن کامل

Educational Needs Assessment of Mashhad Dentists about Principles of Bonding and Adhesives in Dentistry

Background: many of the problems and composite filling failures are due to dentists' inadequate and passing acquaintance with bonding principles. The aim of this study is to assess dentists' acquaintance with the bonding principles and different adhesive application that are going to be used in educational programs. Methods: in this descriptive and cross-sectional study, a valid and reliable qu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2016